SUBMITTED TO THE IEEE TRANSACTIONS ON INFORMATION THEORY 1 

Rate of Channel Hardening of Antenna 
Selection Diversity Schemes and Its 
Implication on Scheduling 

Dongwoon Bai, Patrick Mitran, Saeed S. Ghassemzadeh, Robert R. Miller, and Vahid Tarokh 

Abstract 

For a multiple antenna system, we compute the asymptotic distribution of antenna selection gain 
when the transmitter selects the transmit antenna with the strongest channel. We use this to asymp- 
totically estimate the underlying channel capacity distributions, and demonstrate that unlike multiple- 
input/multiple-output (MIMO) systems, the channel for antenna selection systems hardens at a slower 
rate, and thus a significant multiuser scheduling gain can exist - 0(1/ log m) for channel selection as 
opposed to 0(1/ y/m) for MIMO, where m is the number of transmit antennas. Additionally, even without 
this scheduling gain, it is demonstrated that transmit antenna selection systems outperform open loop 
MIMO systems in low signal-to-interference-plus-noise ratio (SINR) regimes, particularly for a small 
number of receive antennas. This may have some implications on wireless system design, because most 
of the users in modern wireless systems have low SINRs. 

I. Introduction 

The use of multiple transmit antennas has been studied for wireless links because of its promise of 
high spectral efficiency. When the receiver has full channel state information (CSI), the capacity of a 
MIMO channel is typically calculated under the assumption that either the transmitter has full CSI (closed 
loop MIMO) or no CSI (open loop MIMO) [1]. In both cases, in order to achieve data rates close to 
capacity, the implementation of various signal processing and RF units is needed. The underlying costs 
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may increase as the number of antennas increases. For some applications, this cost is prohibitive and 
motivates the studies of alternative antenna technologies. 

Antenna selection schemes are attractive, since they can reduce hardware costs dramatically [2]-[7]. 
Some proposals consider the selection of more than one antenna and require more than one transmit 
chain. Nonetheless, in this paper, we are mainly interested in a transmit antenna selection scheme that 
selects the best channel between transmit antennas. We will demonstrate that under this scheme, there 
is no channel hardening, and thus significant multiuser scheduling gain can exist. This is unlike MIMO 
systems [8], where the asymptotic scheduling gain is zero and there is significant channel hardening. 

To this end, we compute the asymptotic distribution of the selection gain and use this to asymptotically 
estimate the underlying channel capacity distributions. We note that the exact distribution of the selection 
gain has been computed in the literature and the channel capacity has been numerically and explicitly 
(as a series expansion) calculated. However, these exact values are not insightful in predicting if channel 
hardening exists or not, let alone the rate of hardening. For this purpose, we will invoke the theory of 
extreme order statistics assuming Rayleigh fading channels. 

The outline of the paper is given next. In Section |lll we present our system model. In Section |llll we 
calculate the asymptotic distribution of selection gains and outage capacity gains. In Section|lVl we obtain 
upper and lower bounds for the ergodic capacity. These results demonstrate that channel hardening occurs 
at much slower rate than MIMO, and thus significant multiuser scheduling gain can exist. Additionally, 
we will show that even without this scheduling gain, transmit antenna selection can outperform an open 
loop MIMO system in the low SINR regime for small number of receive antennas. In Section |V] we 
compute the scheduling gain. Finally, in Section |Vll we present our conclusions and final comments 
while most of the proofs may be found in the Appendix. 

II. The System Model 

We consider annxm MIMO channel model, with m transmit and n receive antennas. The input-output 
relation is given by 

y = Hs + w. (1) 

The matrix H represents the channel matrix, and is assumed to be known at the receiver. The m x I 
complex vector s is the transmitted signal vector, the n x 1 vector y represents the received signal, and 
w is an n X 1 zero-mean i.i.d. circularly symmetric complex Gaussian noise vector, with covariance 
matrix E[i(;t(;^] = /„. Interference, if any, is assumed to be absorbed in w. An average transmit power 
constraint E[s^s] < p, is assumed, where p > 0. 
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Assuming that a transmit antenna selection system chooses to transmit only on the l-th antenna (/ S 
{1, • • • , m}), the capacity is given by 

C{H,l)=log2(^l + pY,\Hu\^Y (2) 

Let Xi = Yli^=i then the best selection strategy is to choose antenna 

r = argmax[log2 (1 + pXi)] = argmaxX/ (3) 

for transmission. This scheme is of interest, since it eliminates the need to feed back the channel matrix 
H. In fact, the receiver needs only to feed back the index of the best transmit antenna to the transmitter, 
requiring only log2 m bits of feedback information. 

We are interested to see if there is asymptotic channel hardening for such a system, i.e. whether or 
not the underlying scheduling gain asymptotically goes to zero and the rate of which this occurs. To this 
end, we assume that H has independent zero-mean complex Gaussian entries with variance 1/2 per real 
components. This is a flat fading channel model. Thus Xi has chi-square distribution with 2n degrees of 
freedom and Xi is independent of whenever / 7^ 

For any set of i.i.d. random variables Zi, we use the notation Z(^) to denote 

Using this notation, the expected received SINR for the transmit antenna selection strategy is given by 
p • . The asymptotic distribution of for large m is of interest, since the ergodic capacity of the 
selection scheme with the optimal choice of transmit antenna, is given by 

Eh[C{H,Q{1*))] = E^^[log2(l + pXp)] 

= Ex,„, [log2(l + pX(„))] . (4) 
We note that other measures of performance can also be derived based on the distribution of 

III. Order Statistics of the Chi-Square Distribution 

For any n > 1, the probability density function (pdf) and the cumulative distribution function (cdf) of 
the chi-square random variable Xi > with 2n degrees of freedom are respectively given by 

n-l 

fix) = e-^j^— (5) 
[n — Ij! 

and 

n—l ^ 

F{x) = l-e-^Y.l^, (6) 

fe=0 

for X > 0. Clearly, the upper endpoint uj{F) = sup{x\F{x) < 1} is infinity. 



SUBMITTED TO THE IEEE TRANSACTIONS ON INFORMATION THEORY 



4 



Let and /(„^) denote the cdf and the pdf of X(^) . It is well known that 



/ k\ 

\ k=0 ' / 



and 

-im~l / 



fim)ix) ^ fxU^)=m-F^-'{x).f{x) 



m 



respectively, for x > [9, pp. 9-11]. Clearly = and f(m){x) = for x < 0. The ergodic 

capacity of transmit antenna selection can be computed from the above, by numerical integration of (01). 
Analytical solutions seem to be hard to obtain and offer no insights into channel hardening and scheduling 
gain for such a system. 

A. A Key Convergence Result 

First, we have the following technical result whose proof may be found in the Appendix. 
Lemma 1: Let F{-) and uj{F) be as defined above, then 

1) For any t > 0, the value R{t) = f^'-^\l - F{y))dy/{1 - F{t)} is well defined and satisfies 
1 < R{t) < cx). As t ^ CO, 

f 1, if n = 1, 

R{t) = { , (9) 

[ l + (n-l)i + 0(^), ifn>2. 

2) F{-) is in the domain of attraction of the Gumbel distribution. That is to say, for all fixed real x, 

as m — > 00, 

F'^iam + hmx) ^ G{x) ^ exp(-e-"), (10) 
where G{x) is the Gumbel cdf, and the normalizing constants am and bm can be selected to be 

am = qm=F~\l-l/m) (11) 

hm = R{qm)- (12) 

□ 

In the above lemma, other choices of and bm are also possible. We will study this next. 
Notation: For any two real-valued sequences Cm and dm, we define 

Cm « dm if and only if limm^oo \ cm- dm\ = 0; 
Cm = 0{dm) if and only if < linim-^oo Cm/dm < 00. 
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Lemma 2: For F{-) and G(-) defined as above, 

lim F"^(a„ + 6^x) = G(x) 

m— »oo 

if and only if the normalizing constants and 6„i satisfy 

am ~ Inm + (n — 1) In(lnm) — ln(?i — 1)! (13) 
bm ~ 1. (14) 

□ 

Theorem 3: The variance of X(^m,) bounded away form 0, i.e., the effective channel will exhibit 
significant fluctuations. 

Proof: It is known that the Gumbel distribution has a mean 7 = 0.5772... (Euler's constant) and 
respectively, as m increases. Hence, we have 



variance 7r^/6 ([9, p. 298]). Thus the mean and variance of Xi^) will approach to + 7 and 6^7r^/6, 



Var « ^. (16) 

Because the variance is neither zero for any m nor goes near zero as m increases, the channel will 
fluctuate considerably. ■ 
This suggest that the scheduling gain may go to zero considerably slower than for MIMO. The 
scheduling gain of the above antenna-selection system is computed in Section |Vl 

B. Optimizing the Rate of Convergence 

In this subsection, we study the best pairs of normalizing constants Om and bm that provide an accurate 
fit to the Gumbel distribution, even for relatively small values of m. For example, the simple choice of 
Om = Inm + (n — l)ln(lnm) — ln(n — 1)! and = 1 yields a poor estimate of the distribution for 
small m, and may not yield accurate results for a realistic numbers of transmit antennas. 

It is convenient to first introduce the sequence 



am = max i x > 



e""7^^ = -^ (17) 



(n — 1)! m 

which is well defined for all m greater than some M{n). Clearly, am ^ oo as m — > oo. 

Theorem 4: Let F(-) and G(-) be as given above. Consider the rate of convergence (for fixed x) of 

\F'^iam + bmx)-Gix)\, (18) 
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as m ^ oo. 

1) The optimal sequences of constants and minimizing this rate is 

am, if n = 1, 

^ a,„ + (n-l)^ + 0(^), if n>2, 

and 

1, if n = 1, 

_ l + (n- 1)^ + (^), if n>2. 

2) For n = I, the optimal rate of convergence for any given x is 



(19) 



(20) 



\F^{am + bmx)-G{x)\=&(-] , (21) 



and for n >2, 



iF'^iam + bmx) - G{x)\ = e ^ . (22) 



Corollary 5: For F(-), G{-), and qm as given above, the choice of am = Im and 

1, if n = 1, 

l + (n-l)^+0(^) , ifn>2 



□ 



(23) 



also satisfies ( fT9l) and ( [20b . and this is therefore optimal in the sense of minimizing the rate of convergence 
defined in ([HI). □ 
The proofs may be found in the Appendix. 

From ([9)1 and Corollary [5J we can check that the choice of normalizing constants in Lemma[T](am = Qm, 
bm = R{qm)) is optimal. Fig. [T] shows that the Gumbel approximation is an excellent fit with this choice 
of normalization coefficients. It can also be seen from the figure that the variance of X^m) stays bounded 
away from zero as m ^ oo. 

C. Outage Capacity 

By using the above results, given a rate Co, the corresponding outage probability Pout(C'o) can be 
approximated by 

Pout(Co) ^ Pr{log2(l + pX(„))<Co} 
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Fig. 1. Exact and Gumbel-approximate {am 

for m = 2, 5, 10, 20 and n = 1, 2, 5. 



Qm^ bm = R{(lm)) distributions of the selection gain 



(m) 



The outage capacity Cout(^o) can then be approximated as 



Cout(Po) = Po-U^o)=log2[l+/5F-l(Po")] 

w log2[l + p{am - 6mln(- InPo)}]- 



(25) 



Fig. |2] shows 10% outage capacity of transmit antenna selection and MIMO without feedback for SINR 
p = 5 dB. The normalizing constants for the approximations are chosen to be the same as those in Fig. [1] 
The figure indicates that the above approximations improve as m increases. Additionally, even ignoring 
the scheduling gain, the transmit antenna selection scheme outperforms full open loop MIMO schemes 
in terms of outage capacity, when the number of receive antennas is small. 
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Fig. 2. 10% outage capacity of transmit antenna selection and MIMO versus m forn = 1, 2, 3 at p = 5dB. 



IV. Ergodic Capacity 

A. Some Useful Bounds 

Theorem 6: For a random variable X with cdf F{-) as above, 

qm < 1E[X(„,)] < qe-r{m+l), (26) 

where 7 is Euler's constant (e'^ = 1.7810...), Qm is the quantile defined in ([TT]) . and (?e'^(m+i) — 
F^^ (1 — l/e'^(m + 1)). The ergodic capacity C{p) = E [logg (l + p -'^(m))] ^Iso satisfies 

log2 (l+p qm) < C{p) < log2 {l + pq ) , (27) 

for any p > 0. □ 
The proof may be found in the Appendix. 

From Jensen's inequality and the left inequality in (|26l ). we obtain 



log2 (l + p Qm) < C{p) < log2 (l + p IE[X(„)]) < log2 (l+p qe^im+l)) (28) 
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From the fact that ge^{m+i) ~ Qm ~ 7> we can see that 

log2 (1 + P qe-yim+l)) » log2 (1 + P Qm) ■ (29) 

It follows that 

C{p) « log2(l + pE[X(„)]) (30) 

« log2 (1 + p (g„^ + 7)) ■ (31) 

B. Asymptotic Analysis for Large Number of Receive Antennas 

A chi-square random variable X with cdf F(-) as in above is the sum of 2n i.i.d. random variables 
with mean and variance = 0.5. In order to study the change of F(-) and qm as functions of n, it will 
be convenient to write Fn{-) and qm{n) respectively. As 7i increases, (from the central limit theorem) 
{X — n)/^/n converges to the Gaussian distribution with mean zero and variance one. Thus, for large n, 

qm{n) 

By Theorem [6] and dSlJ), 

C{p) 

C. Numerical Results 

Fig. [3] demonstrates the ergodic capacity of various systems for a few basic antenna configurations. It 
is observed that the transmit antenna selection scheme performance is superior to an open loop MIMO 
scheme in the low SINR regime. This may have some implications on wireless system design as most 
of the users in modern wireless systems have low SINRs. In fact, the ergodic capacity of open loop 
MIMO is upper bounded by nlog2(l + p), while the ergodic capacity of transmit antenna selection goes 
to infinity as ?n ^ cxo, and is not upper bounded. 

In Figures |4] and [51 we study the ergodic capacity respectively as a function m and n, for SINR 
p = 5 dB. It is observed that our bounds in ( [27] ) and approximation in ( [3T] ) for the ergodic capacity 
of transmit antenna selection schemes are very good and become exact as ni increases. In terms of the 
ergodic capacity, it is also seen that transmit antenna selection outperforms open loop MIMO when the 
number of receive antennas is small. If more than two receive antennas are deployed, it appears that 
open loop MIMO is better than transmit antenna selection for a moderate number of transmit antennas 



F-^l--]=n + 0{V^). 
m 



(32) 



log2 {l + p{n + 0{^))) 



(33) 
(34) 
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Fig. 3. Ergodic capacity versus p for m = 1, 2 and n = 1,2. 



(m = 1, • • • ,20) at SINR p = 5 dB. Nonetheless, in lower SINR regimes, transmit antenna selection 
will outperform open loop MIMO even for more than two receive antennas. 

V. Scheduling 

For a single cell with multiple users, many scheduling strategies have been proposed. Among them, it 
is known that a greedy scheduling algorithm maximizes the total system capacity. In greedy scheduling, 
the base station selects the user with the best channel at any given time. Only this user may communicate 
with the base station. 

It is known that multiple transmit antennas in MIMO reduce channel fluctuations and thus the benefits of 
scheduling decrease as the number of transmit antennas increases [8]. However, that is not necessarily the 
case for transmit antenna selection because there are significant channel fluctuations even after deploying 
a large number of transmit antennas. We compare the system capacity of greedy scheduling for antenna 
selection to that of round robin scheduling for antenna selection as well as greedy and round-robin 
scheduling for MIMO. Our basic assumption for the analysis is that all users have the same number of 
antennas. 
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Fig. 4. Ergodic capacity versus m for n = 1, 2, 3 at p = 5 dB. 



We define the capacity of a scheduling algorithm where there are K users and each user is equipped 
with n antennas in case of downUnk or m antennas in case of uphnk as the average system capacity 
after scheduUng. 

The greedy scheduhng capacity is then the same as the ergodic capacity of transmit-antenna-selection 
with mK transmit antennas, and it is given by 

E [log2 {l + p - log2(l + P {qmK + 7)), (35) 

using OT]) . From dZTl) , this is upper and lower bounded by log2(l + p qmK) and log2(l + p Qe-^imK+i))^ 
respectively. Note that round robin scheduling has the same capacity as the ergodic capacity of a point- 
to-point link with the same number of transmit antennas and is given by 

E [log2 {1 + p « log2(l + piqm + 7)), (36) 

and this is upper and lower bounded in (l27l ). 

Fig. [6] shows the average system capacity of transmit-antenna-selection and MIMO versus m. The 
approximations and bounds in this figure are those given in ( [35] ). ( [36l ). and the discussions following 
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Fig. 5. Ergodic capacity versus n for m = 2, 20 at p = 5 dB. 



them. We can check that the capacity of greedy scheduling for transmit-antenna-selection increases as 
the number of transmit antennas increases while it decreases for MIMO. Thus, greedy scheduling works 
well with transmit antenna selection in the presence of a large number of transmit antennas. 

We define the scheduling gain as the increase of average system capacity over that without scheduling. 
It can be approximated as 



E [log2 (1 + p - E [log2 (1 + p 



log2 



logs ( 1 + 



1 + P (qniK + 7) 

1 + p (gm + 7) 

qmK - Q; 



log2 1 + P 



ImK - Qn 



1+ p (qni+l) 



logs 1 + 



O 



1 



(37) 
(38) 

(39) 

(40) 



^ log m ^ 

The numerically integrated values of (l37l) for 1 < m < 20 and the approximated values of (l38l ) for 
2 < m < 20 are tabulated in Table U for -5dB < p < lOdB with 5dB increament, n = 1, and K = 32. 
Note that the approximations are at most 0.1 bits away from the exact values. Thus, (l38l) is a good 
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16 



18 



Fig. 6. Average system capacity after scheduling versus m for n = 1 and A' = 32 at p = 5dB. 



approximation to ([37]). 

We define the fractional scheduling gain of greedy scheduling to be the greedy scheduling gain of as a 
fraction of the capacity of round robin scheduling. Since E [log2 (l + p -'^(m))] ^ log2(l + p {qm + 7)). 
it is easy to see that 

E [log2 (1 + p X^^K))] - E [log2 (1 + P ^(m))] _ p / 1 

E [log2 (1 + p ^(m))] \logm log(logm) 

VI. Conclusion 

In this work, we considered the use of transmit antenna selection in multiple antenna wireless systems. 
It was shown that for antenna selection systems (unlike MIMO systems), the channel hardens at much 
slower rate, and thus significant multiuser scheduling gain can exist. Additionally, it was shown that even 
without this scheduling gain, transmit antenna selection systems outperform open loop MIMO systems at 
low SINR regimes, particularly for a small number of receive antennas. The implications of these results 
on wireless system design was briefly discussed. 
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= 32, n = 1 



m 


-5dB 


OdB 


5dB 


lOdB 


1 


0.8084 


1.4366 


2.0183 


2.4095 


2 


0.7803 (0.83) 


1.2899 (1.34) 


1.6826 (1.67) 


1.8935 (1.82) 


3 


0.7528 (0.78) 


1.1958 (1.20) 


1.5048 (1.45) 


1.6540 (1.56) 


4 


0.7308 (0.75) 


1.1308 (1.12) 


1.3926 (1.33) 


1.5119 (1.42) 


5 


0.7131 (0.72) 


1.0826 (1.06) 


1.3139 (1.25) 


1.4155 (1.33) 


6 


0.6984 (0.70) 


1.0449 (1.02) 


1.2548 (1.20) 


1.3445 (1.27) 


7 


0.6861 (0.69) 


1.0144 (0.99) 


1.2082 (1.15) 


1.2895 (1.21) 


8 


0.6754 (0.67) 


0.9890 (0.96) 


1.1702 (1.11) 


1.2464 (1.17) 


9 


0.6661 (0.66) 


0.9673 (0.94) 


1.1384 (1.09) 


1.2084 (1.14) 


10 


0.6578 (0.65) 


0.9485 (0.92) 


1.1113 (1.06) 


1.1772 (1.11) 


11 


0.6504 (0.65) 


0.9321 (0.90) 


1.0877 (1.04) 


1.1502 (1.09) 


12 


0.6438 (0.64) 


0.9174 (0.89) 


1.0670 (1.02) 


1.1267 (1.07) 


13 


0.6377 (0.63) 


0.9045 (0.88) 


1.0486 (1.00) 


1.1058 (1.05) 


14 


0.6321 (0.63) 


0.8925 (0.87) 


1.0321 (0.99) 


1.0872 (1.03) 


15 


0.6270 (0.62) 


0.8816 (0.86) 


1.0172 (0.97) 


1.0704 (1.02) 


16 


0.6222 (0.62) 


0.8717 (0.85) 


1.0035 (0.96) 


1.0551 (1.00) 


17 


0.6178 (0.61) 


0.8626 (0.84) 


0.9911 (0.95) 


1.0411 (0.99) 


18 


0.6137 (0.61) 


0.8541 (0.83) 


0.9796 (0.94) 


1.0283 (0.98) 


19 


0.6098 (0.60) 


0.8463 (0.82) 


0.9690 (0.93) 


1.0164 (0.97) 


20 


0.6062 (0.60) 


0.8390 (0.81) 


0.9591 (0.92) 


1.0054 (0.96) 



TABLE I. Exact and approximated (parenthesis) scheduling gain in bits for various m and p for n = 1 
and K = 32. 



Appendix 

Proof of Lemma |7} In the first part of the lemma, for t > 0, 



n-l 



/ il-F{y))dy = E / 

n—1 k j 



fc=0 

n— 1 k 



k=0 i=0 

and thus 

. r(i-^(y))dy_e-*E^=oEtof; 



1 < R(t) ^ ^ ^^i^ = ^«=u ^»=u ^l (43) 
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From (|43]), if n = 1, R{t) = 1, and if n > 2, 



n-3\ 



l + (n-l)^ + o(^y (44) 



This proves the first part of the lemma. 

From (l44l ). limj_>oo xR{t)/t = for all real x and thus 



l-F{t + xR{t)) e-(*+^-^W)(t + xi?(t))("-i) 
lim — = lim ^ T 

t->oo 1 - F{t) t^oo e-H""-^ 

lim e---«W fi + ^^W'""' 



= e"^. (45) 

For the second part, the result follows from ( |45l) and the following theorem whose proof can be found 
in [10, Ch. 2]. ■ 

Theorem 7: Define the upper endpoint = sup{x|F(x) < 1}. Then F is in the domain of 

attraction of G = exp(— exp(— x)) if and only if there exists some finite a < 00(F) such that 

MF) 



/ (1 - F(y))dy < cxD, 

J a 



(46) 



and for all real x, 



l-F[t + xR{t)) ^ 



where 



1 - F{x) < — } (49) 



m 



Rit) = ^ ^ . (48) 

^ ' 1 - F{t) 

Moreover, the normalizing constants am and bm in (ITOl ) can be chosen as 
and 

6„ = (50) 

□ 

The following lemma in [10, Ch. 2] will also prove useful in the proof of Lemma |2l 
Lemma 8: Let Xm be any sequence of random variables such that, for some constants am, aJn' > 0, 
h*^ > 0, 

lim Pr{Xm <am + bmx} = lim Pr{Xm < a*, + 6*,x} = G{x), (51) 
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for all continuity points of the distribution function G{x). Then (ISTI ) holds if and only if both of the 
following hold 



lini ""^ = (52) 



lim ^ = 1. (53) 

m— »oo Om 

□ 

Proof of Lemma |2} We prove (fT4l ) first. From Lemma [8] necessary and sufficient conditions for bm 

are 



lim = 1 (54) 



From (|9l) in Lemma \T\ we know that 



and 



,1, if n = 1, 

m = { ^ (55) 

l + (n-l)i + 0(^), ifn>2, 



lim q^ = oo. (56) 

m— >oo 



Thus limm_>oo bm should be 1. Since the converse is also true, we have proved (fT4ll . Because of (fT4l l. 
(l52l ) in Lemma [8] reduces to 

lim {am - fflm) = (57) 

and the conditions for am and 6m can be separated. Since the reference am can be chosen to be qm in 
this case, it suffices to prove that 

lim [qm — {Inm + (n — 1) In(lnm) — ln(n — 1)!}] = 0. (58) 

m— >oo 

From the definition of qm, 



\ ^ 1m ^ ^ 

^ k\ m 

k=0 



(59) 



For n = 1, 

qm = lnm, (60) 
and obviously ( [601 ) satisfies (|58] ). Now for n > 2 and m large enough that qm > 1, 

_fc „n-l "--2 „fc 

E 

^ ' k=0 

71-2 

n-2 1 
fc=0 

_jm L„"-2g (61) 

(n-l)!^^'" ^ ^ 



(n — 


1)! 


n- 


-1 


(n — 


1)! 


n- 


-1 
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Thus, 



i^-im — < — < e""?" 



(n — 1)! m 
Taking the logarithms on both sides of (l62l ) gives 



(n-1)! ^ ^™ 



+ (n — 1) Ingm — ln(n — 1)! < — him 



< -Qm + In 



.n-i /^^e(n-l)! 



(n-1)! 



-Qm + {n- l)lnqm - ln(n - 1)! + In ( 1 + 



e(n- 1)1 



and it yields 



Define em as 



In m < — ('T' — 1) In + lii(n — 1) ! < In m + In I 1 + 



e{n-iy. 



gm = Inm + (n — 1) In(lnm) — ln(n — 1)! + 



and then we must prove limm->oo = 0. From (I641 ). 



lim 

m^oo In m 



1 and lim = 0. 

m^oo Inm 



We can see 



1 . ^ lib 

lim 

m— >oo In m 



0. 



Again, by substituting (1651 ) for the leftmost qm in (l64b and defining 6m — In^m — In (In 



(n - < em < (^^ - 1)'^™. + In 1 + 



Because 



lim In I 1 + 



e(n- 1)! 



0, 



we only need to show limm->oo 6m. = 0. For m such that Qm / In m, 
Sm = In - ln(ln m) 

= (gm-lnm)-^, for some Cm G (gm, Inm) or (Inm, g„ 



Inm 



-, for some 7?m G (0,1) 
Inm + r/m(?m - Inm) 

(n — 1) ln(ln m) — ln(n — 1)! + Em 

Inm + r]m{in — 1) In(lnm) — ln(ra — 1)! + Em} 

by mean value theorem. From (l67l) and (l69l) . 



lim 6m = 0. 

m^oo 
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Proof of Theorem |?} First, note that we borrow some of the proof techniques from Hall's paper 
[12] and Galambos's book [10, Sec. 2.10]. We see that am satisfies 



(n — 1) In am + ln(n — 1)! = Inm. 
As in the proof of Lemma |2l it can be shown that 

lim [am — {Inm + (n — 1) In(lnm) — ln(n — 1)!}] = 0. 

m— >oo 

Then, we can express the general normalizing constants am and bm as 

am = oim + 5m and 6m = l + em, 
where 5m and Em are sequences satisfy 



lim (5m = and lim Er 



0. 



(71) 



(72) 



(73) 



(74) 



For n = I, (|2TI) and the choice of ([T9l ) and (l20l ) are proved in [10, p. 142] because the chi-square 
distribution just become the exponential distribution. Hence, let us assume 7i > 2, and define 

Zmix) = m[l- F{am + bmx)]. (75) 

We will shortly prove that Zm{x) as ?n — > oo but assuming that this is the case, from (1751 ). 



F'^iam + bmx) = 

and by the triangle inequality. 



(^) 



m 



(76) 



Gix)\ < \F'^iam + bmx)-Gix) 



< \F"\am + bmx)-e-'-^^^\ 
Also from [10, p. 8], for any z £ (0, 1/2), 



G{x) 



(l-z)-[e 



mi 2mz 



11 < fl - zr < e 



(77) 



(78) 



For fixed x, because Zm{x) e ^, Zm{x)/m G (0, 1/2) for large enough m. By (1781 ) with z = Zm{x)/m, 



\F'^(am + bmx)-e-'-'^^M < 



m 



exp 



2Zm{x) 



m 



exp 



2Zm{x) 



m 



(79) 
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The rate of convergence of \F'^{am + bmx) — e ^™(^)| is dominated by the 1/m term, which will turn out 
to be much faster than that of je"^™*^^^ — Now, consider the rate of convergence of |e-^™(^-)-G(a;)|. 

From the definition of F{x) in Q, we can easily see that as x ^ cxo. 



1 - F(x) 



^ (n-1)! 

From ([TTll and noting that 5m, ^m, 1/"™ ^ as m ^ oo, 

(n-1)! 



n — 1 ^ 
1 + + 



a. 



n-1 



(n-1)! 



n.-l 



m 



e"^ [1 - ((5m + emx) + 0{5l^ + e\ 



l + (n-l) + 



Oir. 



1 '^m £m 



at 



m 



n — 1 



1 - {5m + £mx) H X + O + (5^ + e. 



Or; 



1 



and 



n — 1 ^ 
1+ ^, +0 

'^m ~r OmX 



(am + bma;)2 



a; 



n — 1 ^ 
1 + + 



Because 1 — F{am + bmx) is equal to the product of (ISTl) and (f82l) . 



1 - F{am + bmX 

1 



m 



1 - (<5m + Ema;) + 



(n-1) 



Or 



(l + x) +0 { ^ + Sf^ + e: 



at 



From this, it is clear that as m ^ oo, 

Zm{x) = m[l - F{am + bmx)] e" 
Using the fact that G{x) = exp(— e~^), 
|e-^"W -G(x)| 
= G{x)\e^^{e~'^ - Zm{x)) -l\ 
= G{x) |(e-- - zmix)) + O ((e-^ - Zm{x)f) I 



= G(x)e {5m + £mX) ■ 
Obviously, to cancel out (n — l)(x + l)/am> 

(5rr 



1 



+ x) + 0^ + (54 + e 



(n-1) + 



a^ 



(80) 



(81) 



(82) 



(83) 



(84) 



(85) 



(86) 
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and 



(--1)— + 



dr. 



at 



(87) 



Therefore, ( fT9l ) and ( [201 ) must be satisfied to optimize the rate of convergence. Moreover, if 6^ and 
are chosen as ([86l ) and (l87l) . by the second order expansion, it can be shown that the terms of 0(l/a^) 
cannot be canceled out. Thus 

_ (^(^)| ^ MA _ (gg) 

Because 

am = Inm + (?i - l)ln(lnm) - ln(n - 1)! + o(l), (89) 
^ is much slower than 1/m^Oasm— >cx). Hence, combining (1881 ) and ( 1791 ) into (iTTl ) yields 



From 



\F^(am-rhmX)-G{x)\=e 



\F'^(am.-rh^x)-G(x)\=e 



(log m) 



(90) 



(91) 



Proof of Corollary 15} For n — 1, — — In m and 6^ = 1. Therefore, ([19]) and ([20]) hold. Let 
us now take n > 2. From the definition of am and Qm, 



1 

m 



n-1 



(n-1)! 

where n-iPk = (n — l)!/(n — 1 — /c)!. By taking logarithms, 

-am + (n - 1) In am - ln(?i - 1)! 



k=l 



n-1 



Define £m as qm = am + em- We can see that am 
Substituting am + for qm yields 



Qm + (n - 1) In^m - ln(n - 1)! + In 1 + ^ 

V k=l 

• as m 



oo and e„ 



(n-l)lnfl + ^')+ln ( 1 + |j 



Define 



n-l 



/3m = E' 



(92) 



(93) 

oo because of ([72l ). 

(94) 



It is obvious that Em/ am and /3m — > as m ^ oo. Thus for large enough m, 

am \\ar 



(n-l) 



+ Pm + 0{(3i). 



(95) 



(96) 
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However, since 



^ becomes 



n-l 



Or 



a: 



(n - 1) 1 \-0 [ ^ 

am am ^ 



a; 



(97) 



(98) 



Then 



n-l 



+ 



1 n—1 

n-l /I 
+ 



Or 



Q! 



n-l /I 
1 + + 



at 



(n-l) + 



(99) 



am \ a^ 

Clearly, satisfies ( fT9l ). We can obtain ( [201 ) by substituting ( fT9l ) for qm in 

Proof of Theorem^ We first introduce a simple convex ordering result by van Zwet [11, Ch. 2]. 
Assume X and Y be arbitrary random variables, whose cdfs are Fx and Fy respectively. Van Zwet 
showed that if F-^{Fx{x)) is convex, then Fx^X]) < Fy(E[y]) and Fx(E[X(^^)]) < Fy(E[Y(„)]), 
provided the expectations exist. If Fy^{Fx{x)) is concave, then the inequalities are reversed. Now, we 
return to the chi-square distribution, where the random variable X follows the cdf F{x) in ^ and 
consider a random variable Y with the cdf Fyiy) = — 1/y (-co < y < —1). We can easily see that 



1 

1 . 

m 



(100) 



FYinyim)]) 

Because Fy^(F(x)) = —1/F{x), we only need to show that 1/F{x) is convex. If so, —1/F{x) is then 
concave and 

nX^m)] > (^1 - 1^ = qm. (101) 

Then, the lower bound of ([26l) will be proved. Because F{0) = 0, we can assume X > 0. It will be 
sufficient to show that 

f'{x)F{x) - 2{/(x)}2 



dx'^ 



F{x) 



{F{x)y 



> 



(102) 



for X > 0. However, this can be verified explicitly when n = 1. For n > 2, it follows from 



f{x)F{x) 
{/(^)F 



< 



n-l 

n 

n-l 
n 



n! ^ k + l 



n — 1 {k + n)] 



(103) 
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We now show the upper bound of (l26l ). If a random variable Y has a cdf Fyiy) = 1 — e ^ {0 < y < oo), 
then 

■m 



fc=i 



For X > 0, define 



h{x) = F-\F{x)) = - ln[l - F{x)]. 



(105) 



Then, for n = 1, h"{x) = 0, and for n > 2, 



l-F{x) 



(n-2)! 



i + E^=!(^-i-^) 



En— 1 
A:=0 fc! 



> 0. 



(106) 



Therefore, by convex ordering. 



\k=l 



1 — exp 



1 



m + 1 



m ^ 



fc=l 



exp 



— — ln(m + 1) 



.k=l 



(107) 



For m > 1, we can easily show that J2T=i I ~ li^C"^ + 1) is increasing as a function of m and by the 
definition of 7 as Euler's constant, 

'm.+l ^ 

— — ln(m + 1) 



lim 

m^oo 



m ^ 

— — ln(m + 1) 



,fe=i 



lim 

m— >oo 



1 



Thus YlT=i I ~ ln(m + 1) < 7 and it yields 

E[X(^)] < ( 1 



k=l 



1 



m + 1 



7- 



(108) 



(109) 



Hence, (l26l ) is proved. The upper bound of (l27l ) can be deduced from the upper bound of (l26l ) by Jensen's 
inequality because p > and log2(l + th^i^ ^ concave function. Now, only the proof for the 

lower bound of (1271) remains. Define Z = log2(l + p^)- Then Z(^^^ = log2(l + pXi.yy^-^). The cdf of Z is 

'2^ - 1^ 



Fz{z) = F 



P 



(110) 
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We will show that \/Fz{z) is convex. Let 1/F{x) be Q{x) and then 



1 



Fz{z) 



Q" 



2^ — 1 \ 2^ — 1 



P J P 



-(In 2)2. 



If we substitute (2^ — l)/p for x, then we only need to show 

Q"{x) + ^) + Q'(^) ^ 0' 

for X > 0, and (II 121) becomes 

p;!^ [[-/'(x)F(x) + 2{/(x)}2] (^x + - /(x)F( 
[[-/'(x)F(x) + 2{/(x)}2] X - /(x)F(x)] , 



X 



> 



{F(x)}3 

because -/'(x)F(x) + 2{/(x)}2 > by (fT03]) . We claim 



[-/'(x)F(x) + 2{/(x)}2] X - /(x)F(x) > 0. 



This is because 



-2x 



„n-l 



(n-1)! 



n— 1 I. ' 

— x" 



fc=0 



(---)(^^'-Ei[) + (;r^ 



> 0. 



(Ill) 



(112) 



Thus —1/Fz{z) is concave. By the convex ordering with the cdf Fyiy) = 1 — e ^(0<y< oo), 

/2E[^(™)]-l\ 1 
Fz(E[Z(^)]) = F > 1 - - 



2E[Z(„,] _ ^ 



> F-Ul 



m 



m 



and therefore 



C(p)=E[Z(^)] >log2(l + p-g„). 



(113) 
(114) 
(115) 

(116) 
(117) 

(118) 
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